Tag

#image generation

27 articles

Alibaba's Qwen-Image-3.0 renders full infographic grids and readable ten-pixel text in a single pass

Alibaba's Qwen-Image-3.0 introduces advanced image generation capabilities, including support for 4,500-token prompts, readable ten-pixel text, and complex layout rendering in a single pass.

Jul 217

tech

Google Search now generates AI images when it can't find what you're looking for on the web

Learn to build an AI image generation system similar to Google's new Search feature that creates images from text prompts when web results are insufficient.

Jul 146

Midjourney wants Hollywood studios to reveal the details of their AI usage

Learn to create your own AI image generator using Python and Stable Diffusion. This beginner-friendly tutorial teaches the fundamentals of AI image generation technology.

Jul 445

Google launches Nano Banana 2 Lite for fast AI images and Gemini Omni Flash for video via API

Google launches Nano Banana 2 Lite for fast AI image generation and Gemini Omni Flash for video via API, enabling rapid content creation workflows.

Jun 3049

Gemini’s personalized AI image generation is now free for US users

Learn how AI image generation can create personalized pictures based on your interests and online activities. Understand what this technology means for privacy and personalization.

Jun 2938

research

Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators

Microsoft Research's Lens demonstrates that high-quality training data, such as detailed captions from GPT-4.1, can outperform large-scale models trained on generic data. The open-source model achieves benchmark results with just 3.8 billion parameters.

Jun 851

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft's MAI-Image-2.5 ties with Google's Nano Banana 2 on Arena's leaderboard, showing significant improvements over its predecessor.

May 2754

Google DeepMind Introduces Vision Banana: An Instruction-Tuned Image Generator That Beats SAM 3 on Segmentation and Depth Anything V3 on Metric Depth Estimation

Learn how to set up and use Google DeepMind's Vision Banana model for image segmentation and depth estimation tasks.

Apr 2478

I tried ChatGPT Images 2.0: A fun, huge leap - and surprisingly useful for real work

ChatGPT Images 2.0 delivers impressive improvements in accuracy and text handling, making it a useful tool for branding and infographics, though occasional inaccuracies remain.

Apr 2466

Introducing ChatGPT Images 2.0

OpenAI introduces ChatGPT Images 2.0, a major upgrade to its image generation model featuring improved text rendering, multilingual support, and advanced visual reasoning capabilities.

Apr 2282

OpenAI Beefs Up ChatGPT’s Image Generation Model

OpenAI has released ChatGPT Images 2.0, an upgraded image generation model that produces more detailed images and better text rendering, though it still struggles with non-English languages.

Apr 2171

ChatGPT’s new Images 2.0 model is surprisingly good at generating text

OpenAI's ChatGPT Images 2.0 demonstrates significant advances in AI capabilities, particularly in text integration within generated images.

Apr 2169